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Abstract 

This article introduces a non parametric warping model for functional data. 
When the outcome of an experiment is a sample of curves, data can be seen as 
realizations of a stochastic process, which takes into account the small variations 
between the different observed curves. The aim of this work is to define a mean 
pattern which represents the main behaviour of the set of all the realizations. So 
we define the structural expectation of the underlying stochastic function. Then we 
provide empirical estimators of this structural expectation and of each individual 
warping function. Consistency and asymptotic normality for such estimators are 
proved. 
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1 Statistical model for functional phase variations 

Functional data analysis deals with the analysis of experiments where one or several 
quantities are recorded during a time period for different individuals, resulting in a sample 
of observed curves. One of the main difficultis is given by the fact that curves usually 
not only present amplitude variability (a variation in the y-axis) but also time or phase 
variability (a variation in the x-axis). Hence the classical cross-sectional mean does not 
make sense and the definition of an appropriate population mean is even not obvious. 
Giving a sense to the common behaviour of a sample of curves, and finding a meaningful 
mean curve in this context is thus an important issue, called curve registration, or time 
warping problem, which first appeared in the engineering literature in [llj. 

Several methods have been proposed over the years to estimate the mean pattern 
of a sample of curves. A popular method consists in i) first aligning the curves to a 
given template by warping the time axis, ii) then taking the mean of all the dewarped 
curves. Such methods are increasingly common in statistics, see pj for a review. A 
landmark registration methodology is proposed by [5] and further developed by [1]. A 
non parametric method is investigated in ^ and in [6], using local regressions. Dynamic 
time warping methodology is developed by [H]. An alternative approach is provided in [4J, 
where semi-parametric estimation of shifted curves is studied. But these methods imply 
choosing a starting curve as a fixed point for the alignment process. This initial choice 
may either bias the estimation procedure, or imply strong and restrictive identifiability 
conditions. 
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In this work, we consider a second point of view. We define an archetype representing 
the common behaviour of the sample curves directly from the data, without stressing 
a particular curve. Such a method has the advantage of not assuming any technical 
restrictions on the data and so, enables to handle a large variety of cases. However, the 
registration procedure and the common pattern have to be clearly defined. 

We observe i = l,...,m curves fi : [a,b] — >• R at equidistant discrete times tij G 
[a,b], j = 1, . . . , n. So the data can be written as 

Yij = fi (Uj) , i = 1, . . . , m, j = 1, . . . , n. (1) 

So, the registration problem aims at finding a mean pattern / and warping functions hi 
which align the observed curves, i.e such that Vi = 1, . . . , m, / = /j o /ij. Hence each curve 
is obtained by warping the original curve / using the warping functions hi. 

Defining the registration operator is a difficult task. In this paper we propose a random 
warping procedure which takes into account the variability of the deformation as a random 
effect. So, we assume that there exists a random process H such that the data are i.i.d 
realizations of this process. Let H be this warping stochastic process defined as 

H: n ^ C{[a,b]) 
w t— > H{w, ■), 

where A, P) is an unknown probability space, and (C ([a, b]) , \\-\\^ , B) is the set of all 
real continuous functions defined on the interval [a,b], equipped with the uniform norm 
and with its Borel algebra. Consider hi, . . . ,hm i-i-d realizations of the process H{t). hi 
warps a mean pattern / onto the i-th observation curve fi. Hence, model can be 
modeled by 

Y.J = f^it^J) = foh7'it,,). (3) 

For sake of simplicity, we will write fi{t) = / o h^^(t), for all i E {1, . . . ,m}. We point 
out that h^^ is well defined since the warping processes are assumed to be continuous 
increasing functions. 

Under general assumptions ([2]), model is not identifiable. More precisely, the 
unknown function / and the unknown warping process H can not be estimated. Indeed, 
if h : [a, b] — >■ [a, b] is an increasing continuous function, with h{a) = a and h{b) = b, then, 
for alH G {!,..., m} and all j G {1, . . . , n}, we have that Yij = f o h~^ oho h^^{-, Uj). 
Hence, the function foh~^, associated with the warping process Hoh~^, is also a solution 
of model Q. 

The aim of this paper is to build a new kind of pattern which represents the common 
functional feature of the data but still that can be estimated together with the warping 
procedure. For this, let 0(.) be the expectation of the warping process and define the 
structural expectation fss as 

fss ■= f ° 0"^- 

The structural expectation is obviously not the function /, but the function / composed 
with the inverse of the expectation of H. Hence it can be seen as the mean warp- 
ing of the function / by the stochastic process H. In this article, we aim at studying 
the properties of the structural expectation, and finding an estimator of the structural 
expectation and of the warping paths hi. 
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The article is organized as follows. In Section [H we introduce a warping functional 
model with a stochastic phase variation, and we introduce the structural expectation. 
In Section [21 we define empirical estimators of the structural expectation and of the 
individual warping functions. Asymptotic properties of these estimators are investigated. 
Proofs are postponed to Section |Al Section [3] investigates some extensions of the proposed 
methodology, in particular to the case of noisy non increasing functions. The results of a 
simulation study are reported in Section HI There, we also apply the proposed estimators 
to a real data set. 

2 Theoretical study of a particular case: warping of strictly 
increasing functions 

First, consider the case where / is a strictly increasing. Hence the inverse function 
exists and is also strictly increasing. Moreover, a phase warping of function / (i.e. on 
X-axis) corresponds to an amplitude warping of function (i.e. on y-axis). We propose 
estimators of both the inverse of the structural expectation f^g = cp o /~^, and also of 
each individual warping function (p o h^^, for all i G {1, . . . ,m}, and finally an of the 
structural expectation /es = f ° (p^^- 

Note that all the asymptotic results are taken with respect to m and n, so we recall 
that u(m, n) > c if, and only if, we have 

m,n— »oo 

Ve > 0, 3(mo, no) E N^, V(m, n) E N^, m > mo and n > uq ^ \u{m, n) — c\ < e. 

We will assume the following conditions on the warping process in order to define a good 
registration procedure. 

The warping process does not change the timeline (not time inversion) and leaves fixed 
the two extreme points, so for almost all w E Q, assume that 

i) H{w,-) is an increasing function, 

ii) H{w, a) = a and H{w, h) = b. 

The following proposition introduces respectively the expectation, the second order mo- 
ment and the covariance function of H. 

Proposition 2.1. Under assumption (Ej), the expectation 0(-), the second order moment 
7(-) and the covariance function r(-,-) of the stochastic process H are well defined, (j) 
and 7 are also continuous increasing functions. Moreover, we have 0(a) = a, (pip) = h, 
7(a) = and 'y{b) = 6^. As a consequence, we have that varif(-,a) = varif(-,6) = 0. 

Proof. The process H is bounded and increasing. As a consequence, and 7 exist. 
Moreover, if is a continuous increasing process, which leads to continuous and increasing 
first and second order moments. □ 

In order to prove asymptotic results, the following technical assumptions on the warp- 
ing process H and on the function / are needed: 

Assumptions 

1. There exists a constant Ci > such that, for all (s,t) E [/(a), /(&)]^, we have 

E \H{s) - EH{s) - {H{t) - EH{t))\^ ^ Ci\s - t\\ (4) 
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2. There exists a constant C2 > such that, for all (s,t) G [/(a), /(&)]^, we have 

\f-\s)-f-\t)\'^C2\s-t\'. (5) 

3. There exists a constant C3 > such that, for all u E fl, for all {s,t) G [a, &]^, we 
have 

\H-\LO,s)-H-\uj,t)\^ ^ C^3|s-t|^ (6) 

2.1 Estimator of the structural expectation /es 

Since fi = f o h;^ we have /"^ = /i^ o /-i. Hence E{fi) = {E{H)) o f-\ Hence it 
seems natural to consider the mean of functions ffr^)- n i in order to estimate the 
inverse of the structural expectation. For all y G [/(a), /(&)], and for all i G {1, . . . ,m}, 
define 

ji{y) = arg min 1^^^ - y\ and ri(?/) := (7) 

je{l,...,n} 

Then, the empirical estimator of the inverse of the structural expectation is defined by 

^ m 

S(i/) = -E^^(^)- (8) 

1=1 

The following theorem provides consistency and asymptotic normality of estimator ([H]). 
Theorem 2.1 (Consistency of the inverse of the structural expectation). Under Assumption 

fEl - IeI 0. 

Moreover, let n = m5+" with a > 0, and assume Conditions (jll) and ([5]). Then, 

\ J m— »oo 

where G is a centered Gaussian process with covariance given by: for all (s, t) G [/(a), /(&)]^, 

cov(G(s),G(t)) = r(ri(s),ri(t)). 

From (IHl) and (CD, j^l is an increasing step function with jumps occuring at say, 
K{m, n) points . . . , VK{rn,n) in [/(a), /(6)], such that /(a) = t;o < t;i < . . . < fx(m,n) < 
/(6). Hence for all y G [/(a), /(6)]\(t;fc)fcex: (/C = {0, . . . , ir(m, n) + 1}), 

fssiy) '^^^ tie expressed as 

_fS'(m,n) 
k=0 

with a = Mo < Ui < . . . < UK{m,n)-i < UK{m,n) = b. A natural estimator of the structural 
expectation /es is then obtained by linear interpolation between the points {uk,Vk). For 
all t G [a,b], let define 

K{m,n) — 1 , 

fEs{t) = Y] {vk + ^ —{t - Uk) ) l[u,,u,+,){t) + 'VK{m,n)l{b}{t). 

^ V Uk+1 - Uk J 

Note that by construction, this estimator is strictly increasing and continuous on [a,b]. 
The following theorem states its consistency. 
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Theorem 2.2 (Consistency of the estimator of the Structural Expectation). Under Assump- 
tion (12]), we have 



Ies — Jes 



oo rrt,7i— +00 



0. 



Obtaining confidence bands for f^g is useful for describing and visualizing the un- 
certainty in the estimate of /^g. This requires finding an estimator of \ai{G{y)) = 
7or^(l/)-{/^l(l/)}',yG[/(a),/(6)]. 

Lemma 2.3. Let y e [/(a), Let tTFHi/) = i YZi Tfiv), with T^) defined as in 

©. Then, 

i^\y) - var(G(y)). 



m,n—> 00 



Proof of this lemma is given in Section 4.2. Combining this lemma with the asymptotic 
normality result stated by Theorem 12.11 yields a pointwise asymptotic confidence band for 

fES- 

Corollary 2.4. An asymptotic (1 — Q;)-level pointwise confidence band for /^^ is given by 



p 1/ N /varfGf-u)) -r-r, , YaxiGiy)) 

fEsiy)-ui~%\ J JEsiy) + ^i ■' 



m ^ V m 



where var(G(?/)) = 70 f^^{y) — ^fssiv)^ ^i-f quantile of order 1 — | of the 

standard normal distribution. 

Note that the construction of a simultaneous asymptotic confidence band for /^^ 
would require determination of the distribution of supj^^^^j^^^^^) \G{y)\. This, however, 
falls beyond the scope of this paper. 

2.2 Estimator of an individual warping function 

In a warping framework, it is necessary to estimate the mean pattern but also the 
individual warping functions, i.e. (/ij~^)i6{i,...,m}- As previously, we can not directly esti- 
mate the functions only the functions o h~^. 

Let zo G {1, . . . , m}. Now we want to compute for all i 7^ zq, T*(t) = f^^ o f^^t)- For 
this, define 

joit) = arg min \ti^j - t\ . (9) 
ie{i,...,n} 

This point is the obervation time for the io curve which is the closest to t. Note that the 
index jo(^) depends on but, for sake of simplicity we drop this index in the notations. 
Then Wt G [a, b], and Vi G {1, . . . , m} \ io, compute 

Ti(t) = arg min lYij - Yi,,j,,(^t)\ , (10) 

an estimate of T*. Then for a fixed io, noting that T* = hi o h'^^ , we can see that an 
empirical estimator of each individual warping function (j) o is given by 

^^:it)--=^^Y.m- (11) 

i=l 

The following theorem asserts consistency and asymptotic normality of this estimator. 
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Theorem 2.5. Under assumption ([2]) 



o h - ^ ~ (h o h - ^ 

10 ^ 10 



0. 



Let n = m2+" (with a > 0) and assume that (jl]) and ([6]) hold. Then ^/m{(p o h^^ — (j)oh^^) 
converges weakly to a centered Gaussian process Z, 



mid) o h^^^ 



(t)oh7})-^Z. 



with covariance function defined for all (s,t) G by 

cov(Z(s),Z(t))=r(/ir;(,),/,-;(t)). 



We may also compute confidence bands for 4> o h-^ , based on a consistent estimator 
of var(Z(t)) = 7 o h7\t) - {0 o hr^\t)r. 

Lemma 2.6. Let t e [a, b]. Let 7 o /ir^i(t) = ^™ ^ 7;2(t), with Ti(.) defined by ^ and 
(jni). Then 

- var(Z(t)). 

K J m.n— +00 

Proof of this lemma relies on the same arguments as proof of Lemma [^751 and is outlined 
in Section 4.2. A pointwise asymptotic confidence band for (j) o /^r^ is now given by 

Corollary 2.7. An asymptotic (1 — a)-level pointwise confidence band for o h~^^ is given 
by 



, 1/ N /var(Z(t)) T''~^i , . Yai(Z(t)) 



m 



m 



where var(Z(t)) = 7 o hr^\t) - \<f) o hr^\t) 



3 Extensions to the general case 

In the preceding part, we studied the asymptotic behaviour of a new warping method- 
ology. However, drastic restrictions over the class of functions are needed, mostly dealing 
with monocity of the observed functions and on a non noisy model. In this part, we get 
rid of such asumptions and provide a pratical way of handling more realistic observations. 

First, note that the assumptions H{a) = a et H{b) = b can be weakened into the 
following assumptions 

ii') H^^{-,a) are H~^{-,b) random variables compactly supported such that 

-, a.s 

sup if ~ {w,a) < inf H~ {w,b). 



Then, we focus on the other assumptions. 
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3.1 Breaking monotonicity 

The main idea is to build a transformation Q which turns a nonmonotone function 
into a monotone function while preserving the warping functions. For sake of simplicity, 
the observation times will be taken equal for all the curves, hence tij will be denoted tj. 
Hence, the observations 

Yij = / o /r^(ii), i = 1, . . . ,m, j = 0, ■ ■ ■ ,n, 

are transformed into 

% = GU) ° ■■= 9 o K\t,), z = 1, . . . , m, J = 0, ■ ■ ■ , n, (12) 

where 5^ is a monotone function. So, estimating the warping process of the monotonized 
model can be used to estimate the real warping functions, and then align the original 
observations Yij to their structural mean. 

For this, consider a nonmonotone function / : [a,b] {1, . . . , m} and let a = Sq < 
Si < ■ ■ ■ < Sr < Sr+1 = b he the different variational change points, in the sense that 
VA;e{0,...,r-l}, 

V(ti,t2) y{ts,u) e]sk+i,Sk+2[, 

h < h et t3 < t4 ^ (/(ti) - f{h)) (fits) - fiU)) < 0. ^'^^ 

So, consider functional warping over the subset 

= {f : [a,b] {1, . . . ,m} C R such that ^ holds }. 

Let 71 :]a, . . . , Sr} { — 1, 1} be a tool function which indicates whether, around 

a given point t, the function is / increasing or decreasing, defined by 



71 : Si(^t) < t < S/(f)+l ^ 7l{t) 



-1 si sii^t) - si(t)+i > 0, 
1 si - si(^t)+i < 0, 



with l{t) e {0,...,r}. 
Monotonizing Operator 

Define for all / G the operator Q{., f) : t e.]a, 6[\{so, . . . , s^+i} Q{t, f) by 

r r 

g{t, f) = /(t)7r(t) - ^ vr(t)/(sfc)l],„,,^,[(^) + /(^o) + - f{sk)\l]s,d^). 

k=0 k=l 

and, for all G {0, . . . , r + 1}, 

k 

G{sk,f) = f{a) + J2\fi'i-i)-fi'i)\^ 
1=1 

with the notation Yl^i=i \ f{^i-i) ~ f{^i) \ = 0- 

By construction, it is obvious that t —>■ Q{t,f) is strictly increasing. Moreover, the 
following proposition proves that the warping functions remain unchanged 
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Proposition 3.1. Consider j ^ T and warping functions /ij, z = l,...,m. Set Vi = 
l,...,m, Q{.Ji) =gii.). We have gi = g o . 

Proof. For i E {1, . . . ,m}, let us prove that fi G J^. Using (|T^ . consider the change 
points ^ and set = hi{sk)- For A; G {0, . . . , r — 1}, consider 

(tl, t2) G ] 4, 4+1 [ , (ts, ^4) e ] 4+1, st+2 [ / tl < t2 et t3 < ^4, 

then, 

(/.(ti) - Mt2)) U\{h) - MU)) = if o hr\t,) - / o hT^t^)) (/ o hT\t,) - f o hr\u)) . 

Since h^^ is strictly increasing, we get Sk < hj^iti) < hj^{t2) < Sk+i and s^+i < h^^{ts) < 
K^ih) < Sk+2- Hence 

(/.(ti)-/i(t2))(/.(t3)-/i(t4))<0, 
which yelds that fi G JF. So define gi = G{fi)- For all t G]a, 6[\{si, . . . , }, we get 

r r 

= /,(t)n(t,/,)-^n(t,/,)/,(4)i],.,,,^^[(t)+/,(so)+^ |/.(4-i) 1k,mW' 

ifc=0 ifc=l 

But 

/,(t)=/o/.7l(t), 

U{tJ,)=U{tJohr')=U{hr\t)J), 

which implies that 



gi = g°K^- 



□ 



The discretization implies however that the Zij can not be computed directly since 
the functions fi, i = 1, . . . ,m, are known on the grid tj, j = 0, . . . ,n, while the values, Sk 
and si are unknown. So consider estimates of Zij defined as follows 

Zio = YiQ 

Vj G {1, . . . , n}, 4- = 4_i + \Y,, - F,,_i| . (14) 

The following proposition proves the consistency of such estimation procedure. 
Proposition 3.2. For / G JF, i g {1, . . . ,m} and t G [a,b], define a sequence j(n) such 
that ^ > t. Then, 



Ziifn) — Zii(n] >■ 0. 



Proof. Set t E]a, b[\ {s\, . . . , }, and / G {0, 1, . . . , r} such that t G ] sj, s^+i [. Consider 
j(n), > t, then 3no G N / Vn G N, n ^ no ^ — G IsJ, s^+i T . Vn ^ no, we have 

Zij(n) Zij{n) — 1 ~l~ ^j{n) — 1| 

k=l 
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Moreover, 

%(n) ^9° {tj{n)) 

I 



fe=l 



which yelds that 

Zijin) = n /,) (Fi.xn) - /i (si)) +Yio + J2\fi (4-i) - (4 



k=l 



fe=l 

= A + F,o + 5- 

Write V/c = 1, . . . , Z + 1, ^ ^j^-p/c < ' ' ' < ^j/e ^ *L S®^ ^^^^ 

|/^(4-i)-/^(4)| 

Pk 



q=l 

Pk 



= |/^ (4-l) - f^ {tjk-Pk)\ + E (^^■^-<^) - (^^■^-'^+1)1 + (^^■^) - 



9=1 



Pfc 



|/j (•^fe-l) ^jfc-Pfc| + ^ijk-g\ + (-^fe) I ' 

q=l 



Hence 



B=Y.\fd4-i)-f^{^l 



k=l 
I 



I Pk 



~ X/ l-^' (4-l) ~ ^ijk-Pk I + X/ I -^^^'fc ~ I ~ ^i 

k=l k=l k=l q=l 

I I jl 

= E - y^k-Pk I + E - (4) I + E 1^'^ - ^^^-ii 



fc=l 



fc=l 



fe=l 



Z-1 



Eli-* 



Y- 



-i| — 1^1 — YiQ\. 



k=l 



With the same ideas, we can write 

^=|%n)-/i(si)| = 

9=i(+i-iW+i 



As a result, 

i+i I 

Zij(n) — Zij(^n) = \ fi — ^ijfc-Pfel + \^ijk ~ fi {^k 

k=l k=l 
I 

~ l-^jfc+i-Pfc+i ~ -^Jfe+i-Pfc+i-ll ~ l-^il ~ -^iol- 

jk=l 

By continuity of /, fi is also continuous, hence 

^ij(n) ~ ^ij(n) ' ^ 0. 

For i e { Sq, . . . , we get similar results, leading to the conclusion. □ 

As a conclusion, we can extend our results to the case of nonmonotone functions 
since we transform the problem into a monotone warping problem with the same warping 
functions. These functions hit = 1, . . . ,m can be estimated by our methodology using 
the new observations Zij Vi = 1, . . . , m, Vj = 0, . . . , n. The estimator can be written in 
the following form 



m 

^^rrJt)-^^:^., (15) 



III, I 

1=1 



7—7 



with 

Tj. = arg min 

tj &{tQ,...,tn) 

and 

tj^ — arg min \tj — t\. 

tj&{to,...,tn) 

3.2 Dealing with noisy data 

If theoretical asymptotical results are only given in a non noisy framework, we can 
still handle the case where the data are observed in the standard regression model 

^ij = f° K^{tj) + £ij, i^l,---,m, j = 0,...,n, (16) 

with £ij Af{0, (J^). To be able to apply our algorithm, we first denoise the data. For 
this, we estimate separately each function /j, i = 1, . . . , m, by a Kernel estimator. On a 
practical point of view, we describe the estimation procedure used in the simulations. 



1. Vi e {1, . . . , m}, fi is estimated 

m 

kto) = -J2^r0^^f{to), 
Ul m— ♦+00 



m 

mn)^-y^yin^^f{tn). 
m ^ — ' m-»+oo 
1=1 



Given a Gaussian kernel Vj G {1, . . . , n — 1}, fiitj) is estimated by 



The banwidths Ui are to be properly chosen. 
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2. The estimation procedure can be conducted using the denoised observations fiitj), 
leading to new estimates f{t) of the structural expectation / o 

We point out that the efficiency of the procedure heavily relies on a proper choice of the 
bandwidths Ui, i = 1, . . . ,m. Cross-validation technics do not provide good results since 
the aim is not to get a good estimation of the function but only a good separation of the 
different functions. Hence oversmoothing the data is not a drawback in this settings. So, 
the smoothing parameters i/ = z/j, Vz = 1, . . . , m are obtained by minimizing the following 
matching criterion 

m n 

V = argmin ^ ^ |f (t^) - f{tj) , 

i=l j=0 

over a grid L. Practical applications of this algorithm are given in Figure 



4 Numerical Study 

In this part, we estimate the structural expectation, using both the proposed method 
and the analytic registration approach developed in [9j. Note that an alternative to 
analytic registration is provided by the so-called landmarks registration approach in [5J, 
but this approach requires the determination of landmarks (such as local extrema) which 
can be difficult in our simulations. Hence, this method is not implemented here. First, 
results on simulated data are given in order to compare these two methods. Then, an 
application of our methodology is given for a real data set. 

4.1 Simulations 

Two simulation studies are carried out in this section. The ffist one involves a strictly 
increasing function and the second one involves a non monotone function. 



Warped functions. Let / and g (see Figures [T] and [2]) be defined by 

n „/ N / ^ , . sin(67rt) 

Vt G 0, 1 , f{t) = sin(37rt) + 3nt and g{t) = ^. 

out 

These two functions will be warped by the following random warping process. 



Warping processes. The Stochastic warping functions Hi, i = 1, . . . , m, are simulated 
by the iterative process described below. 
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Let e > 0. First, for all i = 1, . . . ,m, let be the identity function. Then, 
the warping functions h\^^^\ i = l,...,m, are successively carried out from 
functions H^''\ i = 1, . . . , m, by iterating times the following process : 

1. Let U he a uniformly distributed random variable on [lOe, 1 — lOe]. 

2. Let Vi, i = 1, . . . ,m, he independant and identically uniformly distributed 
variables on [U — e,U + e] . 

3. For alH = 1, . . . , m, the warping function if}^-* is warped as follows : 

iff = W, o i/f ) 

where Wi is defined by 

if < t ^ [/, 
ft + fff if f/ < t ^ 1. 

We have thus defined stricly increasing stochatic functions Hi : [0,1] [0,1], such 
that 

VtG [0,1], E{H,{t)) = (t>it) = t. 

Hence, the warping process is centered, in the sense that, the structural expectation fo(j)~^ 
is equal to /. 

Our simulated warping functions Hi, i = 1, . . . ,m, have been carried out by using the 
above iterating process with m = 30, = 3000 and e = 0.005. These processes are shown 
on Figure [1] and on Figure [2l In these figure, we can see the very large phase variations. 
For example, point 0.2 is warped between approximately 0.05 and 0.35 for the first case, 
and between approximately 0.00 and 0.40 for the second case. 

Simulated data. Finally, simulated data are carried out on an equally spaced grid as 
follows : 

and 

Y,^=g{H-' (t,))+e,, 

with tj = j^, j = 0,1, ... ,n, n = 100 points. Simulated warped functions are respectively 
shown on Figures [U and [21 

Figures [H and [2] show the functions / and g and the mean functions of the warped 
functions. We can easily see that the classical mean is not adapted to describe the data. 
In particular, the mean function of the first simulated data does not reflect the fiat com- 
ponent of the function / (in the range [0.2,0.4]) which yet appears in each individual 
warped function. In the same way, the mean function of the second simulated data set 
does not reflect the structure of the individual curves. The classical mean attenuated 
curve variations. 

The estimated structural expectations with both the analytic registration approach 
and the proposed method are shown on Figures [U and [2] (bottom right figures). We 
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can easily see that estimations with our proposed method are closer to the structural 
expectations / and g. These results can be explained as follows : 

• For the first simulated data set, the analytic registration approach does not directly 
work on the strictly increasing functions but on first derivatives. However, theoret- 
ically registering a given function data set is not the same issue as the registration 
of first derivatives of these functions. 

• The analytic registration approach uses the mean curve to register all the functions. 
Due to the drawbacks of the mean curve when dealing with large deformations (for 
instance in the second data set, where the result is a very flat mean curve), the 
structural mean approach provides better results. 

• For both first and second simulated data sets, the analytic registration approach 
works on estimated functions and not directly with given data, which implies an 
additional source of error. 




D.O 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 



Figure 1: Function / is shown on top left figure (solid line). Simulated warping processes 
are shown on top right figure. Simulated warped functions are shown on bottom left 
figure. The classical mean of these functions is draw on top left figure (dotted line). 
Finally, estimations of / with the Analytic registration procedure (dashed line) and with 
our method (dotted-dashed line) are shown on bottom right figure. 

4.2 A concrete application : multiple referees and equity 

The field of application of the results presented in this paper is large. Here we consider 
an example in the academic field : how to guarantee equality in an exam with several 
different referees? 

Consider an examination with a large number of candidates, such that it is impossible 
to evaluate the candidates one after another. So the students are divised into m groups 
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Figure 2: Function g is shown on top left figure (solid line). Simulated warping processes 
are shown on top right figure. Simulated warped functions are shown on bottom left 
figure. The classical mean of these functions is draw on top left figure (dotted line). 
Finally, estimations of g with the Analytic registration procedure (dashed line) and with 
our method (dotted-dashed line) are shown on bottom right figure. 

and a board of examiners is charged to grade the students of one group. The evaluation 
is performed by assigning a score from to 20. 

The m different boards of examiners are supposed to behave the same way in order 
to respect the equality among the candidates. Moreover it is assumed that the sampling 
of the candidates is perfect in the sense that it is done in such a way that each board of 
examiners evaluates candidates with the same global level. Hence, if all the examiners 
had the same requirement levels, the distribution of the ranks would be the same for all 
the boards of examiners. Here, we aim at balancing the effects of the differences between 
the examiners, and gaining equity for the candidates, and at studying real data provided 
by the french competitive teacher exam Agregation de mathematiques. 

This situation can be modeled as follows. Define for each group i = 1, . . . ,m, the 
score obtained by students as X* = |Xj G (0, 20), j = 1, . . . , nj}. Let /», z = 1, . . . , m be 
the repartition function of the group i defined by 

1 " 

Under the assumption that all the examiners give the same ranking to two candidates with 
the same level, which will be denoted Hq, it implies that the random variables X^, . . . , X™ 
are equally distributed. Figure [3] shows the distinct empirical functions for all the m = 13 
groups. 

First, we test for every couple of sets (i,j) the assumption X* ~ X-^. For this we 
perform the following homogeneity test, see for instance [3]. Define for all /c = 1, . . . , 20, 
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YlJLi lx*=fc ra;^ = YlT=i '^x^=k- Define jlk = ^^2n'' ■ Filially set = njlk and 



(4 - nlY ■ ^ (4 - n 



J\2 
k) 



dk " dk 

k=l " k=l " 

Under i^o, Dn = Dl^ + Dj^ converges in distribution to a distribution and almost surely 
to +00 under the assumption that the two laws are different. 

We have n = 4000 candidates and m = 13 examiners. We test the assumption Hq for 
all the different possibilities. In 60% of cases and for a level equal to 5%, we reject the 
assumption that the rankings follow the same distribution. 

As a consequence the following procedure is proposed. We aim at finding the aver- 
age way of ranking, with respect to the ranks that were given within the 13 bunchs of 
candidates. For this, assume that there is such average empirical distribution function 
and that the empirical distribution function of each group is warped by a random process 
from this reference distribution function. Hence a good choice is given by the structural 
expectation, since the functions fj, j = 1, . . . , 13 are increasing so that Theorem 12. II may 
apply. 

Figure [3] shows the different empirical distributions and the structural empirical distri- 
bution function. At each dot on the empirical distribution function, corresponding to 
the rank given within one group, is associated its correspondent structural rank. It is 
obtained by simply projecting onto the structural empirical distribution function. As a 
result we obtain rescaled structural ranks corresponding to the rank obtained by each 
candidate if they could have been judged by an average board of examiner, leading to a 
more fair rankings. Indeed the difference of judgments between each subgroup are revised 
according to the average judgment of all the groups of examiners. 

In conclusion, structural expectation provides a data-driven pattern, which plays the 
role of a reference pattern to which all the different curves can be compared. We applied 
successfully this method to rescale the ranks obtained by candidates evaluated by differ- 
ent boards of examiners. This use is not restrictive since it can be used to provide mean 
patterns for other types of functional data in various fields such as econometry, biology 
or data traffic for instance. 



A Proofs and technical lemmas 

In practice, functions (/i)ig{i,...,m} are evaluated on a discrete interval of R as described 
in Section [H In order to prove Theorem 12.11 and Theorem 12.51 we first study asymptotic 
results on the theoretical continuous model, i.e. 

Mt) = f o hr\t), ^e{l,...,m},te [a, b], (18) 

where H is defined in the same way as in model (El). In a second step, we will extend the 
proofs to the discretized model and prove results of Section 3. 

So consider that all functions are measured on the entire interval [a,b]. After asymp- 
totic results are proved for this continuous model in Section lA.ll we use these results to 
prove Theorem 12.11 and Theorem 12.51 (and subsequently Theoreme 12. 2p in Section IA.2[ 
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Figure 3: Empirical distribution functions (left figure) and structural expectation estima- 
tion (right figure). 
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A.l Asymptotic results for the continuous model 

For the continuous model (fTSll . we provide asymptotic results (analogous to Theorem 
12.11 and Theorem 12.51) and their proofs. 

A. 1.1 Estimator and asymptotic results of the inverse of the structural ex- 
pectation 

Considering the continuous model fll8l) . we define an empirical estimator of the inverse of 
the structural expectation in the following way. Set 

^ m 
i=l 

The following theorem gives us consistency and asymptotic normality of this estimator. 
Theorem A.l. Under assumption ([21), we have that f^g converges almost surely to f^g 

-^0. 



oo m^oo 



Moreover, let assume that assumptions (jl]) and are allowed. Then, we have that 



"^Ues ~ fss) converges weakly toward a zero-mean Gaussian process G: 

where the covariance function of G is defined, for all {s,t) G [/(a), f{b)]'^, by 

Cov{G{s),G{t))=T{r\s)J-\t)). 



Proof. Almost sure convergence of estimator f^g is directly deduced from corollary 7.10 
(p. 189) in [7]. This corollary is an extension of the Strong Law of Large Numbers to 
Banach spaces. 

For alH G {1, . . . , m}, the functions (/r^) jgji ^} ^ire obviously strictly increasing, hence 
f^g is strictly increasing, and we have 

m ^-^ m ^-^ m ^-^ 

i=l 1=1 i=l 

For all z G N*, let 
and, for all m G N*, let 

m 

i=l 

The (X,),g 

{i,...,m} -B-valued random variables, where B — C ([/(a), f{b)]) is a separable 
Banach space. Moreover, the dual space of B is the set of bounded measures on [/(a), /(&)] 
(|10]). Hence, our framework corresponds to Chapter 7 of [7] and thus we can apply 
Corollary 7.10. Indeed, we have 

E(||Xi||oo) < +00 etE(Xi) = 0, 
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then 



Sm a.s „ 

> 0, 



m m-^+co 

which proves almost sure convergence. 

We now turn to weak convergence. From the multivariate CLT, for any A; G N* and 
fixed {yi,...,yk) G [f (a) J (b)]'' , 



v 



V fEsiVk) / 



\ fEsiVk) J 



V 



A4(o,r), 



where the covariance matrix V = (Tij)ij is given by Tij = cov(F ^{yi),F ^{yjj) = 
cov{H{f-^{yi)), H{f-^{yj))), ij = l,...,k. It remains to show that {^/rn{fEl - fEs)} 
is tight. We verify the moment condition stated by \L3l [Example 2.2.12]. 

E v^(72(s)-/^i(.))-v/^(;g(t)-/-i(t))|'^ 

m / 1 \ 

- E (^) - EF-^ (.) - - 5^ /ri (t) - EF- (t) 

i=l \ i=l / 



E 



\F-\s) - EF-\s) - {F-\t) - EF-\t)) 



where the last equality follows from the fact that the /ij's are i.i.d. Then, from (jlj) and 
([5]), we get that 



E 



HJ-eHs) - fEks)) - V^ifEkt) - fEsit)) 



^ CiC2\s-t\ 



which completes the proof. 



□ 



A. 1.2 Estimator and asymptotic results of an individual warping function 

For the continuous model, we define an empirical estimator of the individual warping 
function (p o (ig G {1, . . . , m}) as follows. Conditionally to Fi^ = ft^, for all t G [a, b], 
let 

^ m 

<t>-hT\t) = ;^E/r'°^^oW- (20) 

1=1 

The following theorem gives us consistency and asymptotic normality of this estimator. 
Theorem A. 2. Under assumption ([2]), we have that <p o h^^^ converges almost surely to 



oo m-^oo 



0. 



Let n = ma"*"" (with a > 0) and assume that (jl]) and (jH]) hold. Then, we have that 
m{(j) o /i^^ — (p o h^^) converges weakly to a zero-mean Gaussian process Z, 



with covariance function defined for all (s,t) G by 

Cov((Z,().),Z(t))=r(/.r;(,),/,-^i(t)). 
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Proof. Let io e {1,. . ., m}. Given Fi^ = fi^, 



i=l 



Noting also that o /^r^ = o f^^ , consistency of o /ir^ follows by the same arguments 
as in proof of Theorem lA.ll 

We now turn to weak convergence. From the multivariate CLT, for any k & W and 
fixed (ti, ... ,4) e [a, 6]^, 



v 



V 



A4 (0, To 



where the covariance matrix Fq = (Fo,ij)ij is given by Fo,ij = co\{H{h^^{ti)), H{h-^^{tj))) = 
r{h'[J^{ti), h~'^{tj)), i, j = 1, . . . , k. It remains to show that {^/m{(|) o /^r^ — o h^^^)} is 



tight. Again, we verify the moment condition stated by [13] [Example 2.2.12]. 



E 



H<P ° K^\s) - o /.r;(,)) - V^(0 o /,r^i(t) - o /.r;(t)) 

m, 



fm — 1)2 



1=1 



2-1 



m 



m — 1 



-E 



Now, from assumptions (jlj) and ([6]), we get that 



E 



^(0 ° ^"o'(^) - <^ ° - v^(0 o /ir;(t) - o /ir^i(t)) 



which completes the proof. 



□ 



A. 2 Proofs of asymptotic results 

We now use Theorem lA.ll (given for the continuous model) to prove Theorem 12. 1[ 

Proof of Theorem 12.11 Let y e [/(a),/(6)]. The n observation times are equidis- 
tant and for each i = 1, . . . , m, /, is almost surely increasing, hence f['^{y) — ^ Ti(y) ^ 
fi^{y) + ~- This implies that almost surely. 



fEl{y) - - ^ fEl{y) ^ fEsiy) + -■ 



n 



n 



(21) 



Since 



we get that 



fES + fES 



n 



1 

+ -, 

oo n 



fES + I fES 



'ES ^ n JES 

LHSiEIl) and finally, M - IeI 



oo m,n—*oo 



fES fES 

0, by Theorem lA.il A similar argument holds for 
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Let n = 1712'^°' (a > 0). From (|2Ti) . we get that almost surely, 

1 



MfES - fss) 



Since 



^Ues - Ies) - V^Ues - Ies) 



T^ifES - fEs) - V^UeS - fES 



we get that 



converges almost surely to as m tends to infinity. 



'ESJ 



Combining Theorem I A . 1 1 and Theorem 4.1 in [2], it follows that y/m{f^g — f 
We now turn to proof of Lemma 12.31 

Proof of Lemma [2.31 Consider first the continuous model (fTSl) . and define 

^ m 



1\ D 



G. 



i=l 



oo m^oo 



Using similar arguments as in proof of Theorem lA.il we get that 7 o /-i —70/ ^ 
0. 

Now, since \Ti{y) — ^ ^, we obtain by straightforward calculations that almost 
surely, 

n 171 ^ " ^ 



1=1 



i=l 



n m 



which implies that — YliLi Ti-fi ^{y) — 1 ° f ^(y) — — — ^ 0, from which we deduce that 



m,n—*oo 



From \Ti{y) - /• ^ ^, we also get that 



i=l 



1=1 



i=l 



that is, ^ 7 o f'^{y) - ^ Xllli ^i-fi ^(y) + 7 o f'^iy) ^ from which we deduce that 
that 7 o f^^{y) — — — > 1° f~^{y)- Combining this with Theorem 12. II completes the proof. 



Proof of Theorem \2.2[ For F an arbitrary function, let F ^ denote its generalized inverse, 
defined by F-\t) = mi{y : F{y) ^ t}. By Theorem EH for all y G [/(a),/(6)]. 



fEsiy) 



m,n—>oo 



fEsiy)- 



By Lemma 21.2 in [T2] . 



ifE'sr\t) 



m,n—> 00 



- ifElr\t) 



at every t where (f^s) ^ is continuous. Since f^g is continuous and stricly increasing, 
ifss)'^ is a proper inverse and is equal to / o 0^^, hence for all t E [a, b], 

{f?sr\t)-^fo<p-\t). 

m,n—> 00 
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Now, for all t G [a,b], (f^g) (t) can be rewritten as 

K{m,n) — 1 



fc=0 



and by construction, letting t G {uk(m,n),Uk(m,n)+i] {k{m,n) G /C), we have Vki^,n,n) ^ 
fEsit) ^ Vk(m,n)+i- Combining this and the above equality yields that for all t G 

m,n)y ^fc(m,n)+l ], \fEs{t) - U'es) ^ Vk(m,n)+1 - Vk{m,n)- SiuCC Jes IS COUtinUOUS, 



'^fc(m,n)+l 'Vk{m,n) 



0, and IfEsit) - {fEsr\t)\ 



m.n-^oo 



m,n~*oo 



0. It follows that al- 



most surely, fEs{t) converges to fEs{t)- Uniform convergence finally follows from Dini's 
theorem. □ 

We now use Theorem IA.2I (given for the continuous model) to prove Theorem I2.5[ A 
preliminary definition and a lemma are needed. 
Let define the modulus of continuity of H as 



sup 

(s,f)G[a,b]2 
|s-t|^5 



\Hi o H^\s) - Hi o H^\t)\ ((5^0), 



where Hi and H2 are two independent copies of H. Since for all 5 ^ 0, ^ Kh{5) ^ b — a 
almost surely, we can define L]j{H2,S) and L'jj{H2,S) as L]^{H2,S) = Fi[Kh{6)\H2] and 
L%{H2,5) = E[K]j{S)\H2] {5 ^ 0), and we have L]j{H2,0) = L]j{H2,Q) = 0. From 
continuity of H, it holds that L]j{H2, ■) and L'jj{H2, ■) are right-continuous at 0. Given 
H2 = h, this implies that for e > 0, there exists r]^ > such that Ljj{h,ri^) < e. We shall 
use this result in proving the following lemma: 

Lemma A. 3. Let t G [a,b]. Under assumption ([2]), the following holds: 



sup 

te[a,b] 



m,n—*oo 



0. 



Proof. Letting t G [a, b], one easily shows that 
1 



m 



i=l 



hi o h-^ I t 



Koh.\t) 



This in turn implies that 



sup 

te[a,b] 



_ / x\ 1 

m - 1 ( n ) ' 

i=i ^ ' 



where the K^^^ (^) {i G {1, . . . ,m}\zo) are independent random variables distributed as 
Ku (^) \H2 = hif^. Since Kh{-) is increasing, if n G M is sufficiently large so that ^ < rj^, 
we get that 



sup 

tG[a,b] 



^ m 



i=l 
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By the law of large numbers, we get that almost surely, 



^ limsup sup 

m,n te[a,fe] 



This holds for any e > 0, hence 



n 



«0 



sup 

t€[a,b] 



n 



0. 



□ 



Proof of Theorem 12.51 Let t e [a,b]. For each i e {1, . . . ,m}, fi is almost surely 
increasing, hence almost surely /^^^ o fi^itj^^) — - ^ Tj ^ o figitj^^) + -. Recall that 

= arg minjg{i^,,,^„} \tj — 1\, hence t — ^ ^ tjo ^ t + ^- Thus, combining the above two 
inequalities, we get that almost surely o /^^(t - i) - i ^ ^ o fi^{t + ^) + ^, 
from which we deduce: 



(22) 



We shall focus on the upper bound in this inequality, since the same kind of argument 
holds for the lower bound. Straightforward calculation yields 



sup 

te[a,b] 



^ sup 

te[a,b] 



1 

- + 

n 



o hi} —So h,} 



«0 



where the RHS of this inequality tends to as m and n tend to infinity, by Lemma IA.3I 
and Theorem IA.2I in [2\. It follows that o h^^ — o h^^ — — — ^ 0. 



oo m.n^oo 



We now turn to weak convergence. Assume that n = m2+" (a > 0). From 
holds 



it 



^ sup 

tG[a,b] 



+ sup 

te[a,b] 



m\ 4)0 h^^^ I t 



n 



rrr 



^ 2Z„ 



where Zm = Zl^i (^)- Since 



vaiiZr, 



m 



L u I hj„ , — I s L u \ hi, 



m — 1 \ \ n I \ n 







(recall that n = m2+" and that L\j{hi^^, ■) and L'jj(hiQ, ■) are right-continuous at 0), we 
get that 



mi (ho h 



-1 

«0 



o h 



oo m^oo 
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hence by Theorem IA.2I and Theorem 4.1 in p], \fr(i{(^ o h^^ — (p o h^^) — — — > Z. 

_ m— »oo 

Proof of Lemma 12.61 Consider first the continuous model ffTSl) . and define condi- 
tionally to Fig = 

" m — 1 ^-^ 



i=l 



Using similar arguments as in proof of Theorem IA.21 we get that 



as 



> 

oo m^oo 



0. 



Now, from /"^ o /,„(t _ 1) _ i ^ T, ^ /"^ o /^„(t + }J + ^ we get the following 
inequality 



|T.-/r'%(t)i ^ 



+ 



2 

n 



Acting as in proof of Lemma 12. 3[ it is fairly straightforward to show that for t G [a, &], 
From the above inequality, we also obtain that 



2 

n 



Summing over «€{!,..., m}\io, and then using convergence of ^ Zlj^io ^i-/r^ ° /io(^)) 
of 7 o h~^{t)^ together with right-continuity of L\j{hi^^ ■) and Lj^{hig, ■) at 0, yield that 
7 ° ^in^i^) — ~ — ^ 7 ° ^?Vi^(^)- This and convergence of o h^Mt) complete the proof of 

m,n— >oo 

Lemma 12.61 
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